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C^") Abstract. A random group contains many quasiconvex surface subgroups. 

o 

(N 

o ! 1. Introduction 

"^-h Gromov, motivated perhaps by the Virtual Haken Conjecture in 3-manifold 

topology (now a theorem of Agol [T|), famously asked the following 

i— i Surface Subgroup Question. Let G be a one-ended hyperbolic group. Does G 

Ql^ contain a subgroup isomorphic to the fundamental group of a closed surface with 

O X <0? 

tG Beyond its intrinsic appeal, one reason Gromov was interested in this question 

was a belief that such surface subgroups could be used as essential structural com- 
C ponents of hyperbolic groups [S]. Our interest in this question is stimulated by 

i_i a belief that surface groups (not necessarily closed, and possibly with boundary) 

can act as a sort of "bridge" between hyperbolic geometry and symplectic geometry 
k^ (through their connection to causal structures, quasimorphisms, stable commutator 

qq length, etc.). 

qq Despite receiving considerable attention the Surface Subgroup Question is wide 

r—i open in general, although in the specific case of hyperbolic 3-manifold groups it 

was positively resolved by Kahn-Markovic [ID] . Their argument depends crucially 

■^J- on the structure of such groups as lattices in the semisimple Lie group PSL(2,C). 

By contrast, in this paper we are concerned with much more combinatorial classes 
of hyperbolic groups. Nevertheless, one common point of contact between our 
approach and the approach of Kahn-Markovic is the use of probability theory to 
construct surfaces, and the use of (hyperbolic) geometry to certify them as injective. 
In particular, because our surfaces are certified as injective by local methods, they 
jjj end up being quasiconvex. It is an interesting question to identify the class of 

hyperbolic groups which contain non-quasiconvex (yet injective) surface subgroups 
(hyperbolic 3-manifold groups are now known to contain such groups since they are 
virtually fibered, again by Agol [I]). 

I n [H]) § 9 (also see |13p. Gromov introduced the notion of a random group, i.e. 
a group with fixed generating set and relations chosen randomly from the set of 
all (cyclically reduced) words of some length n with a fixed logarithmic density D. 
Properties of these groups are then shown to hold with probability going to 1 as n — ¥ 
oo; informally one says that a random group at density D has a certain property 
with overwhelming probability. Gromov showed that at density D > 1/2 such 
groups are trivial or Z/2Z, whereas at density D < 1/2 they are infinite, hyperbolic, 
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and two-dimensional (with overwhelming probability). Later, Dahmani-Guirardel- 
Przytycki [7] showed that they are one-ended and do not split, and therefore (by 
the classification of boundaries of hyperbolic 2-dimensional groups), have a Menger 
sponge as a boundary. 

Random groups at density D < 1/6 are known to be cubulated (i.e. are equal 
to the fundamental groups of nonpositively curved compact cube complexes), and 
at density D < 1/5 to act cocompactly (but not necessarily properly) on a CAT(O) 
cube complex, by Ollivier-Wise [14] ■ On the other hand, groups at density 1/3 < 
D < 1/2 have property (T), by Zuk pD6] (further clarified by Kotowski-Kotowski 
[11]). and therefore cannot act on a CAT(O) cube complex without a global fixed 
point. A one-ended hyperbolic cubulated group contains a one-ended graph of free 
groups (see [5J, Appendix A; this depends on work of Agol [T]), and Calegari- Wilton 
[Hj show that a random graph of free groups (i.e. a graph of free groups with random 
homomorphisms from edge groups to vertex groups) contains a surface subgroup. 
Thus one might hope that a random group at density D < 1/6 should contain a 
graph of free groups that is "random enough" so that the main theorem of [6] can 
be applied, and one can conclude that there is a surface subgroup. 

Though suggestive, there does not appear to be an easy strategy to flesh out 
this idea. Nevertheless in this paper we are able to show directly that at any 
density D < 1/2 a random group contains a surface subgroup (in fact, many surface 
subgroups). 

We give three proofs of this theorem, valid at different densities, with the final 
proof giving any density D < 1/2. Theorem |5.2.4 is valid for one-relator groups 
(informally D — 0), Lemma 6.2.2 gives D < 2/7, while our main Theorem 6.4.1 
gives D < 1/2. Explicitly, we show: 



Surfaces in Random Groups |6.4.1] A random group of length n and density 
D < 1/2 contains a surface subgroup with probability 1 — 0(eT n ). In fact, it 
contains 0(e n ) surfaces of genus 0(n). Moreover, these surfaces are quasiconvex. 

This state of affairs is summarized in Figure [Tl A modification of the con- 
struction (see Remark 6.4.2 ) shows that the surface subgroups can be taken to be 
homologically essential. 
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Figure 1. Random groups at different densities 



Along the way we prove some results of independent interest. The first of these 
(and the most technically involved part of the paper) is the Thin Fatgraph Theorem, 
which says that a "sufficiently random" homologically trivial collection of cyclic 
words in a free group satisfies a strong combinatorial property: it can be realized 
as the boundary of a trivalent fatgraph in which every edge is longer than some 
prescribed constant: 
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Thin Fatgraph Theorem |3.3.1[ For all L > 0, for any T > L and any < 

e <C 1/T, there is an N depending only on L so that if T is a homologically trivial 
collection of tagged loops such that for each loop 7 in T: 

(1) no two tags in 7 are closer than 4L; 

(2) the density of the tags in 7 is at most e/L; 

(3) 7 is (T, e) -pseudorandom; 

then there exists a trivalent fatgraph Y with every edge of length at least L so that 
dS(Y) is equal to N disjoint copies ofT. 

If the rank of the group is 2, we can take N = 1 above; otherwise we can take 
N = 20L. 

The Thin Fatgraph Theorem strengthens one of the main technical theorems 
underpinning [5] and [B], and can be thought of as a kind of L°° theorem whose L 1 
version (with optimal constants) is the main theorem of [3] . If r is a long random 
relator, the Thin Fatgraph Theorem lets us build a surface whose boundary consists 
of a small number of copies of r and r _1 . By plugging in a disk along each boundary 
component, we obtain a closed surface in the one-relator group (Fk \ r). If the 
surface is built correctly, it can be shown to be 7Ti-injective, with high probability. 
This is one of the most subtle parts of the construction, and ensuring that the 
surfaces we build are 7Ti-injective at this step depends on the existence of a so- 
called Bead Decomposition for r; see Lemma |5.2.2| Thus we obtain the Random 
One Relator Theorem, whose statement is as follows: 

Random One Relator Theorem |5.2.4[ Fix a free group Fk and let r be a random 
cyclically reduced word of length n. Then G := (Ff. | r) contains a quasiconvex 
surface subgroup with probability 1 — 0(e~ n ). 

The surfaces stay injective as more and more relators are added (in fact, these 
are the surfaces referred to in the main theorem) so this shows that random groups 
in the few relators model also contain surface subgroups, with high probability. 

There is an interesting tension here: the fewer relators, the harder it is to build 
a surface group, but the easier it is to show that it is injective. This suggests 
looking for surface subgroups in an arbitrary one-ended hyperbolic group at a very 
specific "intermediate" scale, perhaps at the scale O(S) where 6 is the constant of 
hyperbolicity with respect to an "efficient" (e.g. Dehn) presentation. 

1.1. Acknowledgments. We would like to thank Misha Gromov, John Mackay, 
Yann Ollivier, Piotr Przytycki and Henry Wilton. We also would like to acknowl- 
edge the use of Colin Rourke's pinlabel program, and Nathan Dunfield's labelpin 
program to help add the (numerous!) labels to the figures. Danny Calegari was 
supported by NSF grant DMS 1005246, and Alden Walker was supported by NSF 
grant DMS 1203888. 

2. Background 

In this section we describe some of the standard combinatorial language that we 
use in the remainder of the paper. Most important is the notion of foldedness for a 
map between graphs, as developed by Stallings [15]. We also recall some standard 
elements of the theory of small cancellation, which it is convenient to cite at certain 
points in our argument, though ultimately we depend on a more flexible version of 
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small cancellation theory developed by Ollivier [12] specifically for application to 



random groups (his results are summarized in § 6.1). 

2.1. Fatgraphs and foldedness. 

Definition 2.1.1. Let X and Y be graphs. A map / : Y — >• X is simplicial if it 
takes edges (linearly) to edges. It is folded if it is locally injective. 

A folded map between graphs is injective on -k\. The terminology of foldedness, 
and its first effective use as a tool in group theory, is due to Stallings [T3] ■ 

Definition 2.1.2. A fatgraph is a graph Y together with a choice of cyclic order 
on the edges incident to each vertex. A fatgraph admits a canonical fattening to a 
surface S(Y) in which it sits as a spine (so that S(Y) deformation retracts to Y) in 
such a way that the cyclic order of edges coming from the fatgraph structure agrees 
with the cyclic order in which the edges appear in S(Y). A folded fatgraph over X 
is a fatgraph Y together with a folded map / :Y —> X. 

The case of most interest to us will be that X is a rose associated to a free 
generating set for a (finitely generated) free group F. 

A folded fatgraph / : Y —>■ X induces a 7i"i injective map S(Y) —> X. The 
deformation retraction S(Y) — > Y induces an immersion dS(Y) — > Y , and we may 
therefore think of dS(Y) as a union of simplicial loops. Under / these loops map 
to immersed loops in X, corresponding to conjugacy classes in m(X). 

Conversely, given a homologically trivial collection of conjugacy classes T in 
m(X) represented (uniquely) by immersed oriented loops in X, we may ask whether 
there is a folded fatgraph Y over X so that dS(Y) represents V (by abuse of 
notation, we write dS(Y) = T). Informally, we say that such a T bounds a folded 
fatgraph. 

2.2. Small cancellation. 

Definition 2.2.1. Let G have a presentation 

G := (xi,- ■■ ,x n | riv ,r s ) 

where the Tj are cyclically reduced words in the generators x i . A piece is a subword 
that appears in two different ways in the relations or their inverses. A presentation 
satisfies the condition C"(A) for some A if every piece a in some r^ satisfies |c|/|r,| < 
A. 

Remark 2.2.2. Some authors use the notation C"(A) to indicate the weaker inequal- 
ity |c|/|r,| < A. This distinction will be irrelevant for us. 

Associated to a presentation there is a connected 2-complex K with one vertex, 
one edge for each generator, and one disk for each relation. The 1-skeleton X for K 
is a rose for the free group on the generators. As is well-known, a group satisfying 
C"(l/6) is hyperbolic, and the 2-complex K is aspherical (so that the group is of 
cohomological dimension at most 2). 

Definition 2.2.3. Fix a group G with a presentation complex K and 1-skeleton X 
as above. A surface over the presentation is an oriented surface S with the structure 
of a cell complex together with a cellular map to K which is an isomorphism on 
each cell. The 1-skeleton Y of the CW complex structure on S inherits the structure 
of a fatgraph from S and its orientation, and this fatgraph comes together with a 
map to X. We say S has a folded spine if Y —¥ X is a folded fatgraph. 
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If G is a small cancellation group, a surface with a folded spine can be certified 
as 7Ti injective by the following combinatorial condition. 

Definition 2.2.4. Let G be a group with a fixed presentation, and let S be an 
oriented surface over the presentation with a folded spine Y . We say S is a- convex 
(for some a > 0) with respect to the presentation if for every immersed path 7 in 
Y which is a piece in some relation r\ with |7|/|t"j| > a, we actually have that 7 is 
contained in dS(Y) (i.e. it is in the boundary of a disk of S). 

Lemma 2.2.5 (Injective surface). Let G be a group with a presentation satisfying 
C"(l/6), and let S be an oriented surface over the presentation with a folded spine 
Y . If S is 1/2-convex then it is injective. Moreover, if S is a-convex for any 
a < 1/2 then it is guasiconvex. 

Proof. First we prove injectivity under the assumption that S is 1/2-convex. Sup- 
pose not, so that there is some essential loop in tti(S) which is trivial in G. After a 
homotopy, we can assume this loop 7 is immersed in Y. Since Y is folded, the image 
of 7 in X is also immersed; i.e. it is represented by a cyclically reduced word in the 
generators. Since by hypothesis 7 is trivial in G, there is a van Kampen diagram 
with 7 as boundary. We may choose 7 and a diagram for which the number of faces 
is minimal. 

The C"(l/6) condition implies that there is a face D in the diagram which has 
at least 1/2 of its boundary as a connected segment on 7. Then the hypothesis 
implies that this segment is actually contained in the boundary of a disk D' of S. 
Since G is C"(l/6) it follows that D' and D bound the same relator in the same 
way, and we can therefore push 7 across D' to obtain a van Kampen diagram with 
fewer faces and with boundary an essential loop in S (homotopic to 7). But this 
contradicts the choice of van Kampen diagram, and this contradiction shows that 
no such essential loop exists; i.e. that tti(S) — > G is injective. 

The proof of quasiconvexity is similar. If S is not quasiconvex, there is a van 
Kampen diagram on an annulus A, with one boundary component a geodesic in G 
and the other a geodesic 7 in Y, in which the ratio of the lengths of the boundary 
components is as big as desired. Since G is C"(l/6) either the annulus has uniform 
thickness (which implies that the ratio of lengths is bounded) or else some face has 
a connected subpath of length at least a of its boundary on 7, for any a < 1/2. 
The argument is then as above. □ 

3. Trivalent fatgraphs 



The purpose of this section is to prove the Thin Fatgraph Theorem 3.3.1 which 
implies that a (homologically trivial) collection of random cyclically reduced words 
bounds a trivalent fatgraph with long edges (i.e. in which every edge is as long as 
desired). 

For concreteness the theorem is stated not for random words but for (sufficiently) 
pseudorandom words, and does not therefore really involve any probability theory. 
However the (obvious) application in this paper is to random words, and words 
obtained from them by simple operations. 

3.1. Partial fatgraphs and tags. We are going to build folded fatgraphs with 
prescribed boundary (i.e. given T we will build Y with T = dS(Y)). In the process 
of building these fatgraphs we deal with intermediate objects that we call partial 
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fatgraphs bounding part of T, and the part of T that is not yet bounded by a partial 
fatgraph is a collection of cyclic words with tags. This language is introduced in [5] . 






Figure 2. Two cyclic words are partially paired along a partial 
fatgraph (the grey tripod); what is left is two cyclic words with 
three tags. 



The (partial) fatgraphs will be built by taking disjoint pairs of segments in T 
with inverse labels (in X) and pairing them — i.e. associating them to opposite 
sides of an edge of the fatgraph. Once all of T is decomposed into such paired 
segments the fatgraph Y will be implicitly defined. 

A partial fatgraph is, abstractly, the data of a pairing of some collection of disjoint 
pairs of segments in T. We imagine that this partial fatgraph Z has boundary 
dS(Z) =: r" which is a subset of I\ The difference T — T' is a collection of paths 
whose endpoints are paired according to how they are paired in T' . The result is 
therefore a collection of cyclic words T" , together with the data of the "germ" of 
the partial fatgraph Z at finitely many points. This extra data we refer to as tags, 
and we call this collection V" a collection of cyclic words with tags. See Figure [2] 
for an example. 

3.2. Pseudorandomness. Random (cyclically reduced) words enjoy many strong 
cquidistribution properties, at a large range of scales. For our purposes it is suffi- 
cient to have "enough" equidistribution at a sufficiently large fixed scale. To quan- 
tify this we describe the condition of pseudorandomness, and observe that random 
words are pseudorandom with high probability. 

Definition 3.2.1. Let T be a cyclically reduced cyclic word in a free group F^ 
with k > 2 generators. We say T is (T, e) -pseudorandom if the following is true: if 
we pick any cyclic conjugate of T, and write it as a product of reduced words Wi of 
length T (and at most one word v of length < T) 

r := WiW2W 3 ■ ■ ■ wpjv 

then for every reduced word a of length T in F^ , there is an estimate 



l-e< 



#{i such that Wi = <j} 

N 



(2k)(2k- 1) 



r-i 



< 1 



Similarly, we say that a collection of reduced words i«j of length T is e-pseudorandom 
if for every reduced word a of length T in Fk the estimate above holds. 
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Lemma 3.2.2 (Random is pseudorandom). Fix T,e > 0. Let T be a random 
cyclically reduced word of length n. Then T is (T, e) -pseudorandom with probability 
l-0(e- Cn ). 

Proof. This is immediate from the Chernoff inequality for finite Markov chains. □ 

3.3. Thin fatgraph theorem. We now come to the main result in this section, 
the Thin Fatgraph theorem. This says that any (sufficiently) pseudorandom ho- 
mologically trivial collection of tagged loops, with sufficiently few and well-spaced 
tags, bounds a trivalent fatgraph with every edge as long as desired. Note that 
every trivalent graph (with reduced boundary) is automatically folded. 

This theorem can be compared with [5], Thm. 8.9 which says that random ho- 
mologically trivial words bound 4-valent folded fatgraphs, with high probability; 
and [3J, Thm. 4.1 which says that random homologically trivial words of length n 
bound (not necessarily folded) fatgraphs whose average valence is arbitrarily close 
to 3, and whose average edge length is as close to log(n)/21og(2fc — 1) as desired 
(and moreover this quantity is sharp). It would be very interesting to prove (or 
disprove) that random homologically trivial words bound (with high probability) 
trivalent fatgraphs in which every edge has length 0(log(n)), but this seems to 
require new ideas. 

Theorem 3.3.1 (Thin Fatgraph). For all L > 0, for any T » L and any < 

e <C 1/T, there is an N depending only on L so that if T is a homologically trivial 
collection of tagged loops such that for each loop 7 in T: 

(1) no two tags in 7 are closer than AL; 

(2) the density of the tags in 7 is at most e/L; 

(3) 7 is (T, e) -pseudorandom; 

then there exists a trivalent fatgraph Y with every edge of length at least L so that 
dS(Y) is equal to N disjoint copies ofT. 

The notation T ^$> L means "for all T sufficiently long depending on L" , and 
similarly < e <C 1/T means "for all e sufficiently small depending on T". The role 
of N will become apparent at the last step, where some combinatorial condition can 
be solved more easily over the rationals than over the integers (so that one needs 
to take a multiple of the original chain in order to clear denominators). In fact, in 
rank 2 we can actually take N — 1, and in higher rank we can take N — 2QL (it 
is probably true that one can take N = 1 always, but this is superfluous for our 
purposes) . 

Except for the last step (which it must be admitted is quite substantial and takes 
up almost half the paper) , the argument is very close to that in [5] . For the sake 
of completeness we reproduce that argument here, explaining how to modify it to 
control the edge lengths and valence of the fatgraph. 

3.4. Experimental results. Theorem |3.3.1| asserts that long random words bound 
trivalent fatgraphs (up to taking sufficiently many disjoint copies). However, in 
order for the pseudorandomness to hold at scales required by the argument, it is 
necessary to consider random words of enormous length; i.e. on the order of a 
googol or more. On the other hand, experiments show that even words of modest 
length bound trivalent fatgraphs with high probability. To keep our experiment 
simple, we considered only the condition of bounding a trivalent graph, ignoring 
the question of whether the edges can all be chosen to be long. 
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Figure 3. The experimental log- fraction of random words of each 
length which bound a trivalent fatgraph. 

In a free group of rank 3, we looked at between 100000 and 400000 cyclically 
reduced homologically trivial words of each even length from 10 to 120. The pro- 
portion of such words that bound trivalent fatgraphs is plotted in Figure [3j The 
vertical axis has a log-scale to show some interesting features of the data. As one 
can see, bounding a trivalent fatgraph happens in practice for n far below the 
purview of Theorem |3.3.1| 

3.5. Proof of the thin fatgraph theorem. We now give the proof of Theo- 
rem HOT] The proof proceeds in several steps. The first few steps are more prob- 
abilistic in nature. The last step is more combinatorial and quite intricate, and is 
deferred to § |4j 

Pick a 7 in r. Choose any location in 7 and write 

7 = W1W2 ■ ■ • Wj^V 

where each \wi\ — T and N = \_\j\/T\. Since 7 is (T, e)-pseudorandom, the Wi are 
very well equidistributed among the reduced words of length T. Moreover, since the 
density of tags is sufficiently low, very few Wi contain a tag. We restrict attention 
to the Wi that do not contain a tag. 

3.5.1. Tall poppies. Throughout the remainder of the proof we fix some X" which 
is an odd multiple of lOi with 1000L < T - T < 2000L. Note that we still have 
T" ^> L. For each wi we let Vi be the initial subword of length T". Note that the 
map which takes a reduced word of length T to its prefix of length T" takes the 
uniform measure to a multiple of the uniform measure, and therefore the Vi are also 
e- pseudorandom. 

The first step is to create a collection of tall poppies. We fix some v := u, 
and read the letters one by one. As we read along, we look for a pair of inverse 
subwords x, X each of length 10L and separated by a subsegment y of length 40L. 
Further we require that the copy of xyX should have the property that the x and X 
are maximal inverse subwords at their given locations, so that the result of pairing 
creates reduced tagged cyclic words. We create some partial fatgraph by identifying 
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X to X; this creates a tall poppy whose stem is x, and whose flower is y. Once we 
find and create a tall poppy, we look for each subsequent tall poppy at successive 
locations along v subject to the constraint that adjacent tall poppies are separated 
by subwords whose length is an even multiple of 10L. Furthermore, we insist that 
the first tall poppy occurs at distance an even multiple of 10L from the start of v. 
See Figure [3] for an example; the "dots" in the figure indicate units of 10L. 



$ 99 9 9 2 



Figure 4. A word v of length 630L with 6 tall poppies folded off 



For each Vi we fold off tall poppies as above. The result of this step is to 
create a partial fatgraph for each 7 consisting of some tagged loop 7' (which is 
obtained from 7 by cutting out all the xyX subwords and identifying endpoints) 
and a reservoir of flowers. Observe that every tagged cyclic word of length 40L 
occurs as a flower, and the set of tagged flowers is e-pseudorandom (conditioned 
on any compatible label on the tag). Informally, we say that the reservoir contains 
an almost equidistributed collection of tagged cyclic words of length 40L. We can 
estimate the total number of flowers of each kind: at each location that a flower 
might occur, we require two subwords of length 10L to be inverse, which will happen 
with probability (2fc— 1) _10L . The number of locations is roughly of size O(|7|/10L). 
So the number of copies of each tagged loop in the reservoir is of size S ■ \j\ (up to 
multiplicative error 1 ± e) for some specific positive 5 > depending only on L. 

3.5.2. Random cancellation. After cutting off tall poppies, the Vi become tagged 
words v[. Observe that the v[ have variable lengths (differing from T" by an even 
multiple of 10L) and have tags occurring at some subset of the points an even 
multiple of 10L from the start. The main observation to make is that the e- 
pseudorandomness of the Vi propagates to e-pseudorandomness of the v[. That 
is, if a is a reduced word of length T' — mlOL > for some even m, then 
among the v[ of length T' — mlOL, the proportion that are equal to a is equal 
to l/(2k)(2k - l) 7 '-™ 101 - 1 up to a multiplicative error of size lie. This is 
immediate from the construction. 

Recall that we chose T" to be an odd multiple of 10L. This means that when we 
pair a segment v\ labeled a with a v'j labeled c -1 the tags of v[ and v'j do not match 
up, and in fact any two tags are no closer than distance L. In fact, it is important 
that after pairing up inverse segments, the tagged loops that remain are reduced, 
so we write each v\ in the form liv"ri where each of li,Ti has length 5L, and pair 
v" with v'J for some v'j of the form Ijv'Jrj where v'j = {v")~ l , and the words Ufj 
and rjj are reduced. By e-pseudorandomness, we can find such pairings of all but 
0(e) of the v[ in this way. 

Thus the result of this pairing is to produce a trivalent partial fatgraph with all 
edges of length at least 10L. Removing this from the v[ produces a collection of 
tagged loops 7" with |t"| = 0(e • I7Q. 
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3.5.3. Cancelling T" from the reservoir. Let T" be the union of all the 7", and pool 
the reservoirs from each 7 into a single reservoir. 

Notice that by construction, no tagged loop in T" has two tags closer than 
distance 20L. It is clear that for each tagged loop v in T" we can build a copy 
of v out of finitely many flowers in the reservoir in such a way that the result 
of pairing v to this v~ l is a trivalent partial fatgraph with all edges of length at 
least L, and the number of flowers that we need is proportional to 2\v\/ L. There 
is a slight subtlety here, in that the length of each flower is 40L, and the result of 
partially gluing up a collection of cyclic words of even length always leaves an even 
number of letters unglued. Fortunately, the assumption that T is homologically 
trivial implies that |T| itself is even, and since each flower also has an even number 
of letters, it follows that |r"| is even. A flower with the cyclic word xyzY can be 
partially glued to produce two tagged loops x and z, and if x and z are odd, each 
can be used to contribute to a copy of some V 1 of odd total length. Since the 
number of odd \v\ is even, all of Y" can be cancelled in this way. 

Since |r"| = 0(e ■ \T\) whereas the number of flowers of each kind in the reservoir 
is of order 5 ■ |T|, if we take edJwe can glue up all of T" this way, at the cost of 
slightly adjusting the proportion of each kind of tagged loop in the reservoir. 

3.5.4. Gluing up the reservoir. We are now left with an almost equidistributed 
collection of tagged loops of length 40L in the reservoir. Adding to the reservoir the 
contribution from each 7 in T, and using the fact that T was homologically trivial, 
we see that the content of the reservoir is also homologically trivial. It remains to 
show that any such collection can be glued up to build a trivalent partial fatgraph 
with all edges of length at least L. 

In fact, we only need two kinds of gluings to achieve this: gluings that result 
in partial fatgraphs that fatten to annuli and to pants. The argument is purely 
combinatorial, but quite intricate and involved, and takes up the content of § 111 

4. ANNULUS MOVES AND PANTS MOVES 

In this section we show that an almost equidistributed collection of tagged loops 
of length 40L can be glued up to a trivalent partial fatgraph with all edges of 



length at least L. Together with the content of § 3.5 this will conclude the proof 
of Theorem [3~3~T1 

Remark 4.0.1. For the entirety of this section, we will rescale 40L to L. That is, we 
prove that an almost equidistributed collection of tagged loops of length L, where 
L is divisible by 4, can be glued up to a trivalent partial fatgraph with all edges of 
length at least L/4. This rescaling is intended to remove meaningless factors of 40 
throughout the argument. 

4.1. Pants and annuli. Let S(L) be the set of tagged loops of length L, where 
L is divisible by 4. Let W(L) be the vector space over Q spanned by S(L); that 
is, W{L) = Q[S(L)]. We define h : W(L) -> Z x Z to be the linear map so that 
h(v) is the homology class of v. Finally, V(L) = kevh C W(L) is the vector space 
of homological trivial vectors. We are interested only in V(L), not W(L), so by 
"full dimensional", we mean a full dimensional subset of V(L). When we say that 
a vector projectively bounds a fatgraph, we mean that there is some multiple of the 
vector which has integer coordinates, and the collection of loops represented by the 
integral vector bounds a fatgraph. A (necessarily integral) vector bounds a fatgraph 
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if the collection of loops that it represents bounds a fatgraph. The uniform vector 
of all l's will be of particular interest, and we denote it by 1. 

We say that a fatgraph Y with boundary a collection of loops in S(L) is thin 
if Y is trivalent and the trivalent vertices of Y are pairwise distance at least L/A 
apart, where the tags are counted as trivalent vertices. Let C(L) be the subset of 
V(L) of positive vectors which projectively bound a thin fatgraph. If v,w E C(L), 
then the disjoint union of the thin fatgraphs for v and w gives a thin fatgraph for 
v + w. Also, the definition of C(L) shows it to be closed under scalar multiplication. 
Hence, C(L) is a cone. A variant of the scallop [4 algorithm gives an explicit 
hyperplane description of C(L), and shows that it is a finite sided polyhedral cone, 
but we won't need this fact in the sequel. 

We will build thin fatgraphs out of two kinds of pieces: (good pairs of) pants 
and (good) annuli (the terminology is supposed to suggest an affinity with the 
Kahn-Markovic proof of the Ehrenpreis conjecture, but one should not make too 
much of this) . A good pair of pants is one whose edge lengths are all exactly L/2 
and whose tags are each on different edges and exactly distance L/A from the real 
trivalent vertices. Note the boundary of each such pair of pants lies in V(L). A 
good annulus is a fatgraph annulus with boundary in S(L) whose tags are distance 
at least L/A apart. Hereafter, all pants and annuli are good. 

Define an involution i : S(L) — > S(L) which takes each loop to its inverse with 
the tag moved to the diametrically opposite position. There are several options for 
the tag at each position - for the definition of t, we arbitrarily choose any pairing 
of the options to obtain an involution. There is a special class of annuli, which we 
call t-annuli, which have boundary of the form s + t(s). Notice that the collection 
of all t-annuli is a thin fatgraph which bounds the uniform vector 1. 

The bulk of our upcoming work lies in manipulating untagged loops, and our 
result here is independently interesting, so we will need some complementary defini- 
tions. Let S'{L) be the set of untagged loops of length L, let W'(L) = Q[S'(L)], and 
let V'(L) be the vector space of homologically trivial vectors in W'(L). We define a 
thin fatgraph and the uniform vector 1' 6 V'(L) as before. The set C'(L) C V'(L) 
is the cone of vectors in V'(L) which projectively bound thin fatgraphs. An un- 
tagged good pair of pants is a trivalent pair of pants whose edge lengths are exactly 
L/2, and an untagged annulus is simply an annulus whose boundary is two loops 
of length L. For untagged loops, i : S' — > <S" is simply inversion, and all annuli are 
i-annuli, although we may refer to them explicitly as i-annuli to emphasize their 
purpose. 

For many applications, the property of a collection of loops that it projectively 
bounds a thin fatgraph is good enough (see e.g. [5]), and this is in many ways a more 
pleasant property to work with, since the set of vectors (representing collections of 
loops) which projectively bound a thin fatgraph is a cone, whereas the set of vectors 
that bound (i.e. without resorting to taking multiples) is the intersection of this 
cone with an integer lattice. However, in this paper it is important to distinguish 
between "bounding" and "projectively bounding" , and therefore in the following 
propositions, we give both the stronger, technical "integral" statement and the 
weaker, cleaner "rational" one. 

Proposition 4.1.1. For any integral vector v € V'(L), there is n E N so that 
(L/2)v + nl' bounds a collection of good pants and annuli. Consequently, C'{L) is 
full dimensional and contains an open projective neighborhood of 1' . 
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There is a stronger version without the L/2 factor if the free group has rank 2. 

Proposition 4.1.2. If the free group has rank 2, then for any integral vector v € 
V'(L), there is n € N so that v + nl' bounds a collection of good pants and annuli. 



We believe that Proposition 4.1.2 is probably true for higher rank, but the proof 
would be more complicated than we wish for a detail that we do not need. 

We delay the rather tedious proof of Proposition |4.1.1| in favor of stating the 
tagged version, which is a corollary and is the version we need. 

Proposition 4.1.3. For any integral vector v E V(L), there is n € N so that 
(L/2)v + nl bounds a collection of good pants and annuli. Consequently, C(L) is 
full dimensional and contains an open projective neighborhood of 1 . 

Proof. Let us be given an integral vector v £ V(L). Define / : V(L) —>■ V'(L) 
to be the map S(L) —> S'(L) which forgets the tag, extended by linearity. By 
Proposition |4.1.1| we can find a collection of pants and annuli which has boundary 
(L/2)f(v) + ml'. Call this fatgraph Y'. Now place arbitrary tags in the forced 
positions on the pants (in the middle of the edges) and in allowed positions on the 
annuli (at least L/A apart) to obtain Y. Clearly, Y is a thin fatgraph, and Y almost 
has boundary (L/2)v + ml, as desired, but the tags are in the wrong places. We 
fix this by simply adding annuli which "twist" the tags into the right positions. 

For each boundary of Y corresponding to a loop 7 in (L/2)v, we can add at most 
two annuli to Y to resolve the boundary into 7 + a + i(a), where a is one or two 
tagged loops; see Figure [5] The remaining boundary of Y consists of a collection of 
inverse pairs in V'(L), but is now incorrectly tagged, and so does not decompose 
into l pairs. Choose a "correct" desired tag arbitrarily for each boundary pair so 
that they constitute an 1 orbit 7 and 1(7). In the same way as we did for the loops 
in (L/2)v, we can add (at most 4) annuli to resolve the boundary into the correct 
7 + 1(7) + a + i(a). The boundary of Y now decomposes into (L/2)v + w + l(w) 
for some integral w £ V(L). By adding t-annuli, we can make the boundary of Y 
be (L/2)v + nl, for some n > m, as desired. 





Figure 5. Adding annuli to fix incorrectly tagged boundary. We 
desire the tagged boundary 71 + 72 + 73- Applying Prop. 4.1.1 



we find the pair of pants, left, with the correct boundary, but tags 
in the wrong places. Adding a few annuli produces the correct, 
tagged boundary, plus a collection of 1 orbits ojj + t(a,). 



□ 



It remains to prove Proposition |4. 1 ,T| which we now do. 
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4.2. Proof of Proposition 4.1.1 Let us be given an integral vector v' £ V'(L). 
The vector v' represents a collection of tagged loops s' € S'(L), for which we must 
find a thin fatgraph Y of the desired form. Our goal is to build a fatgraph which 
bounds s' + t + t(t) for some t £ S'(L). Adding t-annuli will then immediately finish 
the construction. 

If we have a collection of annuli and pants which has boundary s' + t, then the 
problem reduces to finding a collection of annuli and pants which has boundary 
of the form i(t) + u + l(u) for some u € S'(L). We'll repeatedly apply this idea 
to simplify the problem by attaching pants. If we want to have boundary which 
contains a loop 7, and we find a pair of pants with boundary 7 + a + a', then now 
we need only find boundary containing i(a) + l{ol'). In this case, we'll say that 7 
and t{a) + t{ct') are pants equivalent. 

For this entire section, we will assume that our free group has rank 2 and is 
generated by a and b. In § |4.3| we explain the extra details required to deal with 
free groups of higher rank. 

Our strategy will be to start with s' and attach (many) pairs of pants which put 
all the loops in s' into a nice form. Then we attach more pants to further simplify 
the loops, and so on, eventually reducing to a case that is simple enough to handle 
by hand. A run in a loop is a maximal subword of the form aP or if for some 
integer power p ^ 0. Note that any loop contains an even number of runs. A loop 
has balanced a runs, respectively b runs, if all the runs of a's, respectively &'s, have 
the same length. A loop has balanced runs if it has balanced a runs and balanced b 
runs. We first show how to attach pants to reduce any loop to a collection of loops 
with at most 4 runs. Then we attach pants to produce loops with balanced runs, 
then loops with 2 runs, then we simplify this collection into a standard form. 

For clarity, we separate the simplification into lemmas. 

Lemma 4.2.1. Any loop is pants- equivalent to a loop with 4 runs. 

Proof. To begin, we show how to attach a pair of pants to a loop which produces 
two loops, each of which has fewer runs than the initial loop. This method works 
whenever the number of runs is more than 4, so it reduces the loops in s' to a 
collection of loops with at most 4 runs. Let us be given a loop 7. The easiest way 
to visualize attaching a pair of pants is to simply draw a diameter d on 7 between 
two antipodal vertices. Labeling the diameter d produces a pair of pants attached 
to 7. Note that we must be careful to label d compatibly with the labels adjacent 
to the vertices to which we attach d, so that the vertices do not fold. See Figure [6] 






Figure 6. Attaching a diameter to a loop to form a pair of pants. 
The pair of loops on the right is pants equivalent to the loop on 
the left. 
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For concreteness, let us number the vertices of 7 by 0, . . . , L — 1, and we 
let d t be the (oriented) diameter with initial vertex i (and thus terminal vertex 
(i + L/2) modi). Each diameter di divides 7 into two pieces. Let Xi be the 
number of runs in the non-cyclic subword of 7 starting at index i and of length 
L/2; that is, the number of runs in the word to the "right" of di. Similarly, let yi 
be the number of runs to the left. See Figure [JJ Let r be the number of runs in 
7. Note that Xi + yi may be greater than r. Specifically, r < Xi + yi < r + 2. The 
important feature of these numbers is that \xi — Xi+\ | < 1 and \yi — yi+\ | < 1. This 
is easily seen by considering the combinatorial possibilities that occur as we rotate 
the starting point i around the loop 7. See Figure [71 We are particularly interested 
in matched runs, which are runs separated in either direction by the same number 
of other runs. That is, matched runs are "directly across" from one another in the 
list of runs (we use scare quotes to emphasize that matched runs are not antipodal 
in the same sense that "antipodal vertices" are). 



A 



Figure 7. Moving the diameter one position changes Xi and yi by 
at most 1. For the marked diameter, we have Xi = 5 and yi = 4. 
A pair of matched runs is marked in grey. 

If we connect the dots on the graphs of the functions Xi and yi, then by the 
intermediate value theorem, there is some point at which the graphs of Xi and yi 
intersect. This can happen either at some value of i, in which case Xi = yi, or 
between two values i and i + 1, but by the above fact, in this latter case \xi — 
y.i\ < 1 and \xi + i — yt+i\ < 1. In either case, di must intersect two matched runs 
R and R' , perhaps on the boundaries of the runs. Now decrease i until one of 
the ends of di lies on the boundary of R or R' . This puts di in to one of two 
combinatorial configurations, up to rotation and symmetry. See Figure [H] Note 
that the configuration on the right cannot occur at the intersection point, since 
\Xi -yi\ = 2. 

We handle the two cases separately. First, the more generic case illustrated in 
Figure[8]on the left. Here we label the diameter entirely with the generator which is 
not the one labeling the bottom run, and in such a way as to minimize the number 
of resulting runs. Figure |9j left, illustrates this. The sign of the labels on the 
diameter depends on the signs and orders of the generators around the endpoints, 
but the picture is equivalent. Notice that the number of runs in each of the resulting 
loops is at most r/2 + 2. 

In the non-generic case illustrated Figure [8] in the middle, we label half of the 
diameter with one generator and the other half with the other, in a way which is 
compatible with the top and bottom labels. See Figure [9j right. In certain cases, 
it is possible to label the entire diameter with a single generator, and this reduces 
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Figure 8. Possible configurations of d, with respect to the 
matched runs R and R' . Up to rotation and symmetry, there are 
two. Note the configuration on the right cannot occur. 





Figure 9. Labeling the diameter to reduce the number of runs. 
Each label represents a potentially long run of that generator. 



the number of runs still further, but we have illustrated the worst situation. We 
therefore compute again that the number of runs in each of the resulting loops is 
r/2 + 2. 

As long as r/2 + 2 < r, or r > 4, this will produce two loops of strictly smaller 
length. Repeatedly attaching pants resolves our collection s' into a new collection 
of loops, all of which have at most 4 runs. □ 

Lemma 4.2.2. Any loop with 4 runs is pants equivalent to a collection of loops 
with at most 2 runs. 

Proof. This is the most tedious section of the proof, and it relies on balancing out 
the 4 runs. There are two basic moves that we will use, a triangle move, and 
a division move. These are illustrated in Figure 10 The division move is self- 



explanatory. A correct way to think of the triangle move is to imagine rotating 
around the labels on the cut corner to label the new diameter. 

Algebraically, a triangle move takes in a word a ei b e2 a e3 b ei (for clarity, assume 
all the exponents are positive) such that e2 + e% > L/2 and e2 < L/2 and builds a 
pair of pants with boundary 

a ei b e2 a ea b ei + A ei+L / 2 ~ e2 B ei A e3 ~( L / 2 ~ e2 ^B e ' 2 + a L / 2 ~ e2 b e2 A L / 2 ~~ e2 B e2 

The notation obscures the function of a triangle move, which is as follows: it 
produces one loop which is balanced, and another loop with the same run sizes as 
the original one, except that L/2 — e^ of the a's in the top run have been shifted 
down to the bottom run. The signs of the generators may change as they shift, but 
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Figure 10. Using triangle and division moves to rearrange the 
runs in a loop. Lengths aren't to scale, except where marked to 
indicate runs of the same length 



we are only concerned with the lengths at this point. This is the critical feature 
of the triangle moves — shifting generators from one run to the other without 
disturbing anything else. 

Our goal will be to rearrange our 4-run loop into a collection of loops, each with 
a diameter between corners; that is, a diameter which begins at the junction of two 
runs and ends at the junction of the other two runs. Given a loop with a diameter 
between corners, there are two possibilities. Either no inverse pairs of generators 
appear, or there is at least one inverse pair. In the former case, a single triangle 
move produces a 2-run loop, and a loop with an inverse pair. In the latter, we can 
label the diameter with a single generator to produce two 2-run loops. Incidentally, 
performing this operation on a commutator produces two inverse loops. The two 



possibilities are shown In Figure 11 



Of course, not every 4-run loop has a diameter between corners. However, there 
is a sequence of moves that produces a collection of loops with diameters between 
corners. There are three possible inputs to this sequence, illustrated in Figure [12} 

(1) an equidistributed loop; 

(2) a half- equidistributed loop; or 

(3) a loop which has two adjacent runs of lengths L/A and L/2, respectively. 
An equidistributed loop is a loop which has two diameters at right angles (with 
endpoints equally spaced) such that one diameter begins on one b run and ends 
on the other, and the other diameter begins on one a run and ends on the other. 
It is acceptable for the diameters to begin or end on a vertex between runs. A 
half-equidistributed loop is a loop for which one of the runs has length L/2, and 
both runs of the other generator have length at most L/A. 

The sequence of moves which takes any of these three types of loops and produces 



loops with diameters between corners is shown in Figure 13 The output, at bottom 
right, is a perfectly balanced loop, which trivially has a diameter between corners, 
and the loop with the diameter shown. Although it is not to scale, this latter loop 
does have a diameter between corners, since it is the product of a triangle move on 
the loop to the left, and thus has two adjacent runs, both of length L/A. 
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Figure 11. Performing triangle moves on loops with diameters 
between corners in order to produce 2-run loops. There are two 
possibilities, depending on whether an inverse pair appears. 




Figure 12. Two loops which are both equidistributed (left), a 
loop which is half-equidistributcd, and a loop with adjacent runs 
of lengths L/2 and L/4. 



So producing 2-run loops reduces to showing that we can make any loop equidis- 
tributed or half-equidistributed. For this, we'll use triangle moves to shift runs to 
appropriate locations. 

Before moving on to the next step of the argument, we remark that the triangle 
and division moves can be reinterpreted geometrically in the following way. Given 
a 4-run loop, let us rescale so the loop has length 1. The run lengths are now real 
numbers which sum to 1, so the space of such loops is 3-dimensional. Triangle and 
division moves are piecewise linear transformations, where the pieces are given by 
linear inequalities. The set of equidistributed loops is a polyhedron, so this lemma 
shows that by repeated application of the triangle and division piecewise linear 
maps, the entire polyhedron of 4-run loops can be brought inside the equidistributed 
polyhedron. Figure [T4| shows the equidistributed polyhedron; it is a skew-projection 
of a 4-dimensional cube (i.e. a zonohedron). 

We now return to the proof. Suppose we are given a 4-run loop. Without loss of 
generality, let us suppose there are at least as many a's as o's, and let G = #a — #6 
be the generator inequity, recording how many more a's than 6's there are. Let x 
and x' be the number of 6's in the longer and shorter b runs, respectively. Abusing 
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Figure 13. Quartering an equidistributed loop to produce ex- 
actly balanced loops and loops with a diameter between corners. 
Lengths are not to scale, and some lengths are labeled. The op- 
eration is read left to right, top to bottom. Notice that along the 
way, we handle the case of a half-cquidistributed loop and a loop 
with adjacent runs of lengths L/2 and L/4. 




Figure 14. The polyhedron of equidistributed 4-run loops. 



notation, we'll also refer to the runs themselves as x and x' . Note that the a or 
b runs may have negative exponents, so they are actually runs of A or B. For 
simplicity, we'll use the "positive" notation. 

First, let us eliminate the degenerate case that there are very few 6's. Suppose 
that x + 2x' < L/2. Consider the two diameters starting at the ends of x. There 
are two cases: if x = x' and the runs are exactly antipodal, then both diameters 
run between corners, which is a case we have already dealt with. Otherwise, one 
of the diameters misses x' entirely, and we can cut to produce an even loop and a 
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loop whose generator inequity is strictly smaller (the roles of a and b are reversed, 
and the inequity becomes 2x' < 2(L/2 — x — x') = G, i.e. smaller than the current 



inequity). See Figure 15 





Figure 15. If the b runs are short, 
generator inequity. 



we can cut to reduce the 



Therefore, we may assume that x + 2x' > L/2, and by definition, x > x' . We 
conclude that x > L/6 and x' > L/4 — x/2, and therefore that x + x' > L/3. It 
follows that G < L/3. Define R = x — x'. As a warm-up, let us first assume that 
x < L/A. Since x, x' < L/4, if we can find a diameter between x and x' , we are 
done, for the diameter at right-angles will always be admissible, and the loop will 
be equidistributed. 



target range 




Figure 16. The picture showing the offset P that we'd like to 
minimize and the target range; we succeed in making the loop 
equidistributed or half-equidistributed if we shift the endpoint of 
x into the target range, as it is in the picture. 



We always read loops counterclockwise. Let d be the diameter starting at the 
end of the x' run. Though we won't explicitly use it, it is useful to consider the 
quantity P, which is the distance between the endpoint of d and the end of x. 
Figure [l6| shows the picture. Notice that if the endpoint of x lies anywhere in the 
(closed) interval indicated, then there is a diameter between x and x'. We'd like to 
make the magnitude of P small. 

Recall that when performing a triangle move at x, we shift L/2 — x of the a's 
from one of the a runs to the other. We compute this in terms of our variables. 
Note the total number of b's is (L — G)/2, so 
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Therefore, if we denote the number of a's which are shifted on every triangle move 
at x by s, we have 

L L G R L R 

s = x = 1 < , 

2 4 4 2 ~ 3 2' 

where we have used the fact that G < L/3. It is better if s is smaller, because then 

we can be more accurate with the placement of x. The size of the target region for 

the endpoint of x is x + x' (x + x' + 1 vertices), so if s < x + x' , then the shift size 

is small enough that we can place the endpoint of x somewhere in the region. But 

we know that 

L R L 
S <3-2<3<,W, 

so the x run can be placed as desired to make the loop equidistributed. 

Recall that this was the restricted case that x < L/4, which actually turns out to 
be the hardest. To complete the argument, we must handle the case that x > L/A. 
The issue is no longer that we can make a diameter from x' hit x, but that x might 
overlap the other diameter which lies at a right angle to the first. This problem is 



illustrated in Figure 17 




target range 



Figure 17. If a; is large, we have to make sure it doesn't cross the 
diameter at a right angle to the diameter from x' . 

Let us compute the size of the target range for the endpoint of x. As one can see, 
the target range has size 2(L/4) + x' — x = L/2 + x' — x. But recall that the amount 
of a's we shift between a runs is the shift size s = L/2 — x, so we immediately 
have s < L/2 + x' — x; i.e. the shift size is smaller than the target range, so we 
can succeed in placing the endpoint of x in the desired range. This completes the 
argument showing how to make any 4-run loop equidistributed. 

Since we have shown how to make any 4-run loop equidistributed, and any 
equidistributed loop into 2-run loops, we have reduced ourselves to a collection of 
loops, each of which has at most 2 runs, as desired. □ 

For the next step, we will modify our collection of loops with 2 runs to put them 
in a standard form. A uniform loop has a single run, so just one generator appears. 
An even loop has two runs of the same length (so length L/2). The type of a loop 
with two runs is a pair that records which generators appear, so for example (a, B). 

Lemma 4.2.3. Any collection of loops with 2 runs is pants equivalent to a collection 
of uniform loops, even loops, and at most one loop of each type. 

Proof. This step involves shifting and combining loops into even and uniform loops, 
which will leave a finite remainder. All of these operators are performed on loops 
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of the same type. The first step is to arbitrarily select a generator to be the small 
generator on each loop. We'll choose b. Any loop whose b run has length over L/2 
can be cut with a diameter labeled with just a to produce an even loop and a loop 



with b run length less than L/2, as shown in Figure 18 An important feature of 




Figure 18. If a loop has a b run length larger than L/2, it can 
be cut to produce an even loop and a loop with a b run shorter 
than L/2. See the text for a discussion of why the inner boundary 
components become inverse on the right. 



cutting to reduce the size of the b run is that it doesn't change the type of the 
loop. We have been somewhat casual about this thus far, because it makes the 
pictures easier to understand, but recall from the introduction to this section that 
if we have a loop 7, and we find a pair of pants with boundary 7 + a + a', then the 
remaining problem is to find a collection of pants with boundary t(a) + o(a'); that 
is, recall that 7 is not pants equivalent to a + a', but rather is pants equivalent to 
i(a) + t(a'). Therefore, as shown in Figure 18 the input is the loop of type (a, 6), 
and the output is an even loop and another loop of type (a,b). The same holds 
true for the other operations we describe here. 

We aren't concerned with the even loops, so we turn our attention to the loops 
with b run length strictly less than L/2. There are two necessary operations here; 
the trade, in which we swap pieces of the b run between loops, and the combine, in 
which we combine two small b loops into a single one (and produce a uniform loop 
and two even loops as byproducts). The combine operation works on any two loops 
whose total number of 6's is strictly less than L/2. These operations are shown 
in Figures 19 and 20 Algebraically, the trade operation takes in a Pl b tl+t2 and 



a P2 b Wl+W2 , where h,t 2 ,w 2 > and twi > 0, and produces a Pl b tl+W2 and a P2 b Wl+t2 . 
The combine operator takes in a Pl b Tl and a P2 b r2 , where ri + r 2 < L/2, and produces 
a P3 b ri+r2 . 

Notice that the trade operation requires only that one of the diameters be interior 
in the b run, or as written above, w± > but w 2 > 0. Using the trade operation, we 
can take every b run and, for each one, move all but a single b onto a single chosen 
loop. Whenever this chosen loop contains a b run longer than L/2, we cut it off 
and start trading again. After this, we are left with a single loop with an unknown 
length b run, and possibly many loops with a single b. Then, we use the combine 
operation to combine these loops into a smaller number of loops with longer b runs. 
Then we trade the b mass to our chosen loop again, and so on. 

After trading and combining as much as possible, we are left with either no loop 
(if the only remaining loop is even), a single loop with a b run of length less than 
L/2, or two loops whose combined run length is exactly L/2. This last case arises 



22 



DANNY CALEGARI AND ALDEN WALKER 

b h b b 

A, 




Figure 19. Trading pieces of b runs between loops 





Figure 20. Combining b runs onto a single loop. The first step 
produces a byproduct uniform a loop, the second an even loop, 
and the third another even loop. 



because we cannot use the combine operation on these loops. There is yet another 
sequence of moves, however, to resolve this: we use the trade operation to obtain 
two loops with b runs of length exactly L/A. Then we cut and join them to produce 
a perfectly balanced loop with 4 runs. A single triangle move results in an even 
loop plus a commutator; we then apply Figure [TT] to the commutator to get a pair 
of inverse loops. 

Doing this to each loop type proves the lemma. □ 



The final step in the proof of Proposition 4.1. 1| is to show that we can attach 
pants and annuli to the output of Lemma 4.2.3 that is, uniform loops, even loops, 
and a single remainder loop of each type, so that we have nothing left. Let x a ,b, 
x a .B, %A,b, and xa,b denote the run length of the single b run in the remainder loop 
of each type. Rescaling and considering arbitrary L, we can think of each variable 
as a real number in the open interval (0, 1/2). The fact that the entire collection 
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of loops must be homologically trivial gives us two linear equations 

X a ,b + Xa : B ~ X A ,b ~ %A : B = h 
(1 - X a> b) - (1 - X a . tB ) + (1 - X A: b) - (1 - XA,b) = k 2 , 

Since there are even and uniform loops to consider, it is not a priori the case that 
fci = &2 = 0. However, it is the case that /ci,fc2 S |Z, and k\ — k\ € Z, since the 
uniform and even loops change homology discretely. Solving the linear equations 
shows that the only solutions to this linear system in (0, 1/2) exist when fei = k% = 
0, and in fact it must be that x a j, = xa,b & n d a; a ,s = XA,b- Since these variables 
are equal, we can use annuli to glue the remainder loops. The homologically trivial 
collection of uniform and even loops can now be glued along entire edges, so is 
obviously pants equivalent to the empty collection. That is, we have successfully 
produced a collection of pants and annuli with boundary s' + t + i(t), where s' is our 
original collection, and t is the many intermediate boundaries we used to reduce s' 
to the empty set. Adding in t-annuli, then, gives us a collection of pants and annuli 
which has boundary s' + nl', for some sufficiently large n. 

Observe that we never need to duplicate our collection s', or, equivalently, mul- 



tiply v' by any factor. We have therefore proved the stronger Proposition 4.1.2 for 
rank 2 free groups. 

4.3. Higher rank. We have completed the proof of Proposition |4. 1 . l] in the case 
that the free group has rank 2. We now describe the necessary modifications to the 
argument for higher rank free groups. Given a collection of loops s' £ S'(L), the 
same technique of cutting with diameters works to show that s' is pants equivalent 
to a collection of 4-run loops. However, triangle moves no longer apply, since each 
of the 4 runs might be a run of a different generator. 

The first step is to attach pants in such a way that we are left with 4-run loops, 
each of which only involves two generators. This is actually quite straightforward, 
since we have more freedom with the labels on the diameters that we attach. Fig- 
ure [21] shows how to attach diameters to produce loops of the desired form which 
are pants equivalent to the original loop. Technically, the figures represent simply 
unions of pants, not the (non-trivalent) fatgraphs shown. They are drawn as shown 
to emphasize the point that we produce several other byproduct loops, but they 
come in cancelling inverse pairs. All the interior diameters shown have length L/2, 
even though they are not drawn to scale. 

We remark that the pictures in Figure [21] are general, up to rotation and reflec- 
tion. The double-diameter from top to bottom exists by the argument in the proof 
of Lemma |4.2.1| and the double-diameter from the lower left corner to the middle 
exists because the first diameter has length L/2. We also remark that in higher 
rank, it is possible that we have some 3-run loops. It is simple to cut these to 4-run 
loops and then apply the above argument. 

After applying Lemma |4.2.2[ we may assume that we are left entirely with uni- 
form loops, even loops, and one 2-run loop of each type. Now, though, there are 
4(2) loop types, which is too many to duplicate the linear-algebraic argument from 
the previous section. The simple solution is to take L/2 copies of our collection. 
Now each loop type is repeated exactly L/2 times, so when we re-collect the re- 
mainder, we are left with no remainder, so we have only uniform loops and even 
loops, which can be paired arbitrarily. Therefore, we have found a collection of 
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Figure 21. Reducing each loop to lie in a rank 2 subgroup. The 
fatgraph on the left is built out of folded pants only when b and 
d are distinct generators. If 6 = d , then we use the picture on 
the right. A similar picture holds when the vertical diameter has 
cndpoints on runs of the same generator. The point is that all of 
the non inverse-matched pants boundaries lie in rank 2 subgroups. 

pants and annuli which has boundary (L/2)v r + nl', which completes the proof of 
Proposition |4.1.1| 



5. Random one-relator groups 

Throughout this section we fix a free group Fk with fc > 2 generators, and we 
fix a free generating set. For some big (unspecified) constant n, we let r be a 
random cyclically reduced word of length n, and we consider the one-relator group 
G:=(F\r). _ 

In this section wc will show that with probability going to 1 as n — > oo, the group 
G contains a surface subgroup tti(S). The surface S in question can be built from 
N disks bounded by r and N disks bounded by r , glued up along their boundary 
in such a way that the 1-skeleton is a trivalent fatgraph with every edge of length 
> L, where L is some (arbitrarily big) constant fixed in advance, and N < 20L is 
the constant in the Thin Fatgraph Theorem |3.3.1| 

The group G is evidently C'(X) for any A > with probability going to 1 as 
n — > oo, and therefore the injectivity of S can be verified by showing that the 
1-skeleton of S does not contain a long piece in common with r or r -1 , except for 
a piece contained in the boundary of one of the 2N disks. 

A trivalent graph Y in which every edge has length at least L has at most 
\Y\ ■ 2 m / L subpaths of length m, where \Y\ is the length of Y. In a free group of 
rank fc there are (approximately) (2fc — l) m reduced words of length m, and the 
relator r contains at most n of them. Taking m — e ■ n, a simple counting argument 
shows that Y does not have any path of length m in common with an independent 
random word of length n. However, Y and r are utterly dependent, and we must 
work harder to show that S is injective. 

The key idea is to break up Nr U Nr~ x into pieces (called beads) which each 
bound their own trivalent fatgraph. Then subpaths in the fatgraph associated to 
one bead will be independent of the subpaths of Nr U Nr^ 1 associated to another 
bead, and this argument can be made to work. 



RANDOM GROUPS CONTAIN SURFACE SUBGROUPS 25 

5.1. Independence and correlation. A random word in a finite alphabet (in 
the uniform distribution) has the property that any two disjoint subwords are in- 
dependent. A random reduced word in the free group fails to have this property 
since (for example) if uv are adjacent subwords, the last letter of u must not can- 
cel the first letter of v (so the words are not really independent). However, such 
words have a slightly weaker property which is just as useful as independence in 
most circumstances; this property can be summarized by saying that correlations 
decay exponentially. This means that if u and v are reduced words, and we fix 
some distance T, then the probability of finding some substring v at a specific 
location conditioned on finding u at a location a distance T from it, differs from 
l/(2k)(2k— l)" -1 (i.e. the probability if disjoint subwords were really independent) 
by (2k — 2)~ T . See e.g. [3:, Lem. 2.4 for a careful proof. 

If T ^> \v\ (for instance, if \v\ — 0(log(n)) while T = 0(n e )) then in practice 
one can treat u and v as though they were independent. In the sequel we will say 
informally that correlations (between disjoint subwords) decay exponentially. 

5.2. Beads. Let r be a random cyclically reduced word of length n. We fix some 
(small) positive constants C and 5] later we will say how small they should be. 

Definition 5.2.1. A bead decomposition of r is a decomposition of r into segments 
labeled (in cyclic order) ro, r^ , f^ , • • ■ , ritf— 1> r M > r M-i> ' ' ' > r i where each r i has 
length n 1 - 8 ■ (1/2 + o(l)) and r Q ,r M have length n 1 ' 5 ■ (1 + o(l)) (so that M = 
n s ■ (1 + o(l))) so that the prefix of rf of length Clog(n) is inverse to the suffix of 
rr, and the prefix of Tm of length Clog(n) is inverse to the suffix of tm- 

Given a bead decomposition of r, we glue the mutually inverse subwords de- 
scribed above, thereby decomposing r into a sequence of loops ("beads") of length 
n x ~ s ■ (1 + o(l)) with one or two tags. Denote these beads Bo, B\, • • • , Bm- The 
edges of length Clog(n) obtained by gluing the inverse prefixes/suffixes we refer to 
as the lips of the beads. 

Lemma 5.2.2 (Bead Decomposition). There exists a bead decomposition with prob- 
ability 1 — 0(eT n ). Moreover, the bead decomposition can be chosen so that the 
correlations between distinct Bi decay exponentially. 

Proof. We start at an arbitrary location in r, and take this to be the approximate 
midpoint of r . Extend this region n 1_<5 ■ (1/2 — n~ s ) in either direction, and 
then read the next pair of segments of length n l ~ synchronously until the first 
time we read off a pair of mutually inverse segments of length Clog(n). If C < 
(1 — 2<5)/log(2fc — 1) then such a pair of mutually inverse segments will occur with 
probability 1 — <3(e~" ) by Chernoff's inequality, for some c > depending on S. 
We then build successive segments r i in order by the same procedure. There are 
n such segments, and this polynomial term can be absorbed into the exponential 
estimate of probability at the cost of adjusting constants. To see that correlations 
between different Bi decay exponentially, imagine that we generate the word r by 
a Markov process letter by letter as we go, so that we only generate successive Bi 
after having already constructed Bj with j < i. □ 

There is no reason to expect that the beads Bi are homologically trivial, but 
there is a trick to adjust them so that they are. We build a bead decomposition 
simultaneously, so that the beads of r^ 1 have inverse labels to the 
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beads of r. We denote the beads of r _1 by B~ , and note that they are inverse (as 
tagged cyclic words) to the Bi. Then for each i the union BiUB~ l is homologically 
trivial. 



By Theorem 3.3.1 for each i, and for some N < 20L, the collection NBiUNB~ 
bounds a trivalent fatgr aph Yj with all edges of length at least L, with probability 
1 — 0{er n ). By Lemma 5.2.2 for j ^ i the beads Bj are almost uncorrelated with 



the Yi, so for any e there is c(e) so that with probability 1 — 0(e n ) there are no 
paths in Yi of length n e in common with any segment in r — r i . 

Lemma 5.2.3 (No long piece). Let Y be the fatgraph obtained from the union of the 
Yi associated to a bead decomposition as above. Then with probability 1 — 0(e~" ) 
every path in Y of length e • n which appears in r or r~ x is in dS(Y). 

Proof. Let 7 be a path in Y of length e • n and let 7'cr (without loss of generality) 
have the same labels as 7. Then for any e > there is a c > so that for each i and 
each subsegment a' of 7' of length n e contained in Bi the corresponding subsegment 
a' of 7' must have at least (1 — e) of its length contained in Yi. By the definition of 
the bead decomposition, successive subsegments of 7' in adjacent Bi are joined by 
paths of length C log(n) running over the lip. The corresponding subsegments in 7 
that transition from Yi to Y i+ i must also run over the lip, so there is another copy 
of the word on the lip within n e of the lip. If the two copies are not distinct, so that 
7 and 7' overlap on a common piece, then since Y is folded we must simply have 
7 = 7' and 7 is in dS(Y) as claimed. Otherwise there are two distinct copies of 
the lip contained in a segment of length n e in r. If e is sufficiently small compared 
to C, the probability that two identical subwords of length Clog(n) will occur in a 
specific segment of length n e is arbitrarily small (in fact, of size 0(n~ e ) for some e' 
depending on e and C). The probability of such occurrences at adjacent lips is not 
independent, but the correlations decay exponentially, so since this must happen 
for 0(n s ) successive segments Yi (the segments that 7' intersects in order), the 
probability of all these lips having nearby distinct copies is 0(e~ n ) and the lemma 
is proved. □ 

We deduce the following theorem as a corollary: 

Theorem 5.2.4 (Random One-Relator). Fix a free group F^ and let r be a random 
cyclically reduced word of length n. Then G := (F^ \ r) contains a quasiconvex 
surface subgroup with probability 1 — 0(eT n ). 



Proof. This follows from Lemma|5.2.3|and Lemma 2.2.5 □ 



In fact, there are 0(e n ) many choices of bead decomposition, and most of these 
give rise to quasiconvex surface subgroups. 

Definition 5.2.5. We call the surfaces constructed as above beaded surfaces. 

Note that a beaded surface has genus 0(n). In fact, in the case that k = 2, a 
beaded surface can actually be taken to have genus o(n), since we can take N = 1 
in the application of the Thin Fatgraph Theorem, and then as n — ¥ 00 we can take 
L — > 00. It seems very likely that the surfaces can be taken to have genus o(n) for 
any k. 
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6. Random groups 

In this section we prove our main theorem, that a random group at density 
D < 1/2 contains a surface subgroup with probability 1 — 0(eT n ). In fact, our 
argument shows that it contains many subgroups (of genus 0(n)). Our argument 
depends on some elements of the theory of small cancellation developed for random 
groups by Ollivier [12 , and we refer to that paper several times. 

6.1. Small cancellation in random groups. For later convenience, we here state 
three results from Ollivier [T3] that we use in the sequel. 

Theorem 6.1.1 (Ollivier, |12j . Thm. 2). Let G be a random group at density D. 
Then for any positive e, and any reduced van Kampen diagram T> containing m 
disks, we have 

\dT)\ >{l-2D-e)-nm 
with probability 1 — 0(e~ n ) 

Here the hardest part is to show that the same e works for van Kampen diagrams 
of arbitrary size. 

Theorem 6.1.2 (Ollivier, [12], Cor. 3). Let G be a random group at density D. 
Then the hyperbolicity constant 6 of the presentation satisfies 6 < 4n/(l — 2D) with 
probability 1 — 0(eT n ). 

Theorem 6.1.3 (Ollivier, |12j . Thm. 6). Let G be a random group at density D. 
Then for any positive e, and for any reduced van Kampen diagram T> with at least 
two faces, there are at least two faces which have a (connected) piece on dT> of 
length at least n(l — 5-D/2 — e), with probability 1 — 0(e~ n ). 

The statements of theorems in Ollivier's paper do not make the estimate of 
probability (as a function of n) explicit; however these estimates are straightforward 
to derive from his methods (and in any case, we do not use them in the sequel). 

6.2. Convexity and (m, a)-convexity. 

Definition 6.2.1. Fix a free group Fk with k > 2 and a free generating set. 
A random group of length n and density D < 1 is obtained from Fk by adding 
(2k — l) nD independent random cyclically reduced relations of length n. 

Gromov showed that a random group at density D satisfies the small cancellation 
condition C" '(2D) with probability going to 1 as n — > oo. Pick one relator r, and 
build a beaded surface S as in § [5] whose spine is trivalent and with every edge of 
length > L for some large (fixed) L. 

Lemma 6.2.2. Fix D. Then for any a > D, a beaded surface S constructed by 
the method of § [5^ is a-convex, with probability 1 — 0(e~ n ). 

Proof. By Lemma |5.2.3| for any positive e, the spine Y of S has no piece of length 
e • n in common with the relator r except for pieces occurring in dS(Y). For any 
a > there are 0(2 Q ™/ L ) paths in Y of length an. Define S = log(2)a/L so that 
p Sn _ 2 an / L ) anc j note that by taking L sufficiently large, we can make 8 as small 
as we want. 

There are (2k—l) an reduced words of length an, and a random relator r' contains 
n subwords of this length, so a random relator r' has probability 0( e < 5 ™- 1 °g( 2 ' £ - 1 ) Q ™) 
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of having a piece of length an in common with Y . If a > D and 8 < log(2fc — 
l)(a — D) then no relator r' ^ r has a subpath of length an in common with Y, 
with probability 0(e~" ). □ 



We deduce by Lemma [2. 2. 5 1 that a random group contains a surface subgroup at 



any D < 1/12. However, Theorem 6.1.3 already improves this to D < 2/7. 

To go further we need two ingredients — we need to generalize the condition of 
a-convexity to rule out the existence of relators with several short pieces in common 
with Y, and we need to control the dependence of different faces in a big diagram. 

Definition 6.2.3 ((m,a) convexity). Let G be a group with a hxed presentation, 
and let S be an oriented surface over the presentation with a folded spine Y. We 
say S is (m, a)-convex (for some integer m > and some real a > 0) with respect 
to the presentation if for every immersed path 7 in Y which contains m disjoint 
subpaths 7j which are disjoint pieces in some relation r, t with |7j|/|rj| > a, we 
actually have that 7 is contained in dS(Y) (i.e. it is in the boundary of a disk of 
S). 

Note that an (m, a)-convex surface is ma-convex, but the converse is not neces- 
sarily true. 

Lemma 6.2.4. Fix D. Then for any m,a with ma > D, a beaded surface S is 
(m,a)-convex with probability 1 — 0(e~ n ). 

Proof. The proof is essentially the same as that of Lemma |6.2.2| The probability 
that a random relator r' has a piece of length an in common with the spine Y is 
Q|y5n-iog(2fc-i)cm^ an( j ftie probability of having m disjoint pieces of that length 
is therefore O( e ™(<5n-log(2fc-i)an))_ So for ma > D the result follows as before. □ 

6.3. van Kampen disks. Our strategy will be to show that the existence of a 
certain kind of van Kampen disk T> with boundary a cyclically reduced word in Y 
essential in S gives rise to a contradiction. Suppose that 7 c Y is an essential loop 
in S whose image is trivial in G, so that there is some van Kampen diagram T> with 
boundary 7. If some face in T> has boundary r or r -1 , and if this face has more 
than en in common with 7, then this face agrees with a disk of S, and we can find 
a smaller van Kampen diagram T>' by pushing across this disk. So in the sequel we 
will only consider loops 7 C Y essential in S bounding van Kampen disks T> which 
cannot be simplified by such a move. We call such a van Kampen disk efficient. 
The following Lemma is standard. 

Lemma 6.3.1 (Short shortcut). Let G be a hyperbolic group with a presentation 
with respect to which it is 8 -hyperbolic. Let T be a cyclically reduced word in the 
generators which is trivial in G. Then there is a van Kampen disk T> with \dT>\ < 
186 and a connected subpath 7 C dT) with 7 C T and \j\ > \dT>\/2. 

Note that if 7' = dT) — 7 then |7'| < |7|. In other words, 7' is a shortcut; hence 
the terminology. 

Proof. In any 5-hyperbolic path metric space, for any k > 88, a fc-local geodesic 
(i.e. a 1-manifold for which every subpath of length at most k is a geodesic) is a 
(global) (f^lf ,2(5)-quasigeodesic; see 0, Ch. III. H, 1.13 p. 405. The loop T starts 
and ends at the same point, and is therefore not a fc-local geodesic for k > 98. 
Therefore some segment of length at most 98 is not geodesic, and it cobounds a 
van Kampen disk T) with an honest geodesic. □ 
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We deduce the following corollary: 

Lemma 6.3.2. Suppose that S is a beaded surface which is not ni-injective. Then 
there are constants C and C depending only on D < 1/2, a path 7 in the spine 
Y (not contained in dS(Y)) of length at most Cn, and a van Kampen diagram D 
containing at most C faces so that 7 c dT) and \j\ > \d"D\/2. 



Proof. Theorem 6.1.2 says that S < 4n/(l — 2D), so by Lemma 6.3.1 it follows that 



there is such a disk T) with boundary of length at most 72n/(l — 2D). On the 



other hand, by Theorem |6.1.l| we know 72n/(l - 2D) > \d1)\ > (1 - 2£> - e) • nC 
where C" is the number of faces; in particular, C" is bounded in terms of D (and 
independent of n) . □ 

The fact that C and C" can be chosen independent of n (but depending on 
D < 1/2 of course) is crucial for our purposes. 

6.4. Surfaces in random groups. We can now prove the main theorem of the 
paper. 

Theorem 6.4.1 (Surfaces in random groups). A random group of length n and 
density D < 1/2 contains a surface subgroup with probability 1 — 0(e~ n ). In fact, it 
contains 0(e n ) surfaces of genus 0(n). Moreover, these surfaces are quasiconvex. 

Proof. Pick one relation r and build a beaded surface S by the method of § [5] Wc 
have already shown that the 1-skeleton Y of S does not contain a path of length e-n 
in common with r or r _1 except for paths in the boundary of a disk, with the desired 



probability. By Lemma 6.2.4 the surface is (m, a)-convex for any 1/2 > ma > D. 



Suppose S is not 7Ti-injective. Then by Lemma 6.3.2 there is an efficient van 
Kampen disk with boundary an essential loop in Y, containing a subdisk T> with at 
most C" faces, and at least half of its boundary equal to some path 7 in the spine 
Y. Since the original disk was efficient, no face of T> with boundary label r or r" 1 
has more than en in common with 7. 

Fix a combinatorial type for the diagram. Then there are at most polynomial in 
n choices of edge lengths for the edges in the diagram. Choose a collection of edge 
lengths. Let m < C be the number of faces. 

We estimate the probability that there is a way to label each face with a relator or 
its inverse compatible with 7. We express the count in terms of degrees of freedom, 
measured multiplicatively, as powers of (2k — 1) (Gromov uses the terms density 
and codensity; see the discussion in [8] pp. 269-272 expanded at length in |13j). 

Each choice of face gives nD degrees of freedom, and each segment in the interior 
of length (. imposes I degrees of constraint. Similarly, the segment 7 imposes I7I 
degrees of constraint. Let J denote the union of interior edges. Then |D| +2| J| = nm 
so I7I + p| = nm/2 because I7I > \dT>\/2. On the other hand, the total degrees of 
freedom is nmD < nm/2, so no assignment is possible, with probability 1 — 0{eT n ). 
Summing the exceptional cases over the polynomial in n assignments of lengths, 
and the finite number of combinatorial diagrams, we see that S is injective, with 
probability 1 — 0(e~ n ) for some c depending on D (and going to as D — >• 1/2). 

In fact, the same argument implies that every essential path in Y with no long 



segment on dS(Y) is actually quasigeodesic, by the proof of Lemma 6.3.1 The 



theorem follows. □ 
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Remark 6.4.2. The surfaces we build are homologically trivial, since they map 
nontrivially over only one disk bounded by a relator r, and with total degree 0. It 
turns out that essentially the same method produces many homologically essential 
surfaces. 

If n is even, a random reduced word of length n in a free group of rank k is 
homologically trivial with probability 0(n~ k / 2 ). Since there are (2k—l) nD relators, 
there are an enormous number of such homologically trivial relators, and we can 
try to build a surface mapping over the associated disk with degree 1 (and therefore 
being homologically essential in G) . Evidently, the only obstruction to finding such 
surfaces is to build a bead decomposition as in Lemma |5. 2. 2 where all the Bi are 



homologically trivial, while still preserving the property that correlations between 
distinct Bi decay exponentially fast. The probability that the naive construction 
of a bead decomposition (as in the lemma) applied to a random word will have this 
property is (n~ k ' 2 ) n , which is subexponential in n, so many of the relators will 
have this property, and we can build many homologically essential surfaces of genus 
0(n). If n is odd we can build a similar (homologically essential) beaded surface 
from two (judiciously chosen) relators. 

Remark 6.4.3. The surfaces we build have genus 0(n) (or o(n) for rank 2), and 
it is natural to wonder if this is the best possible. We conjecture not; in fact we 
conjecture that the smallest genus injective surfaces in random groups are of genus 
0(n/ log n) (at any density D < 1/2). 

In fact, Thm. 4.16 of [3] lets us give a precise estimate of the geometry of the 
Gromov norm on H 2 (G;M.). Let V be the vector space with the relators n as 
basis, and let W be the kernel of the natural map V — > Hi(Fk;M.). Then we can 
identify W with H2(G;M), by Mayer- Vietoris. The vector space W inherits an L 1 
norm from V with respect to its given basis. The Gromov norm on W (on random 
subspaces of fixed dimension) is (with overwhelming probability) proportional to 
this L 1 norm, with constant of proportionality 21og(2fc — l)n/31og(n), up to a 
multiplicative error of size 1 + o(l) (there is a factor of 4 relative to the statement 
of Thm. 4.16 of [3]; this factor of 4 reflects the difference between the Gromov norm 
and the so-called scl norm). It seems plausible that classes in iJ 2 (G;Q) should be 
projectively represented by norm-minimizing surfaces; such surfaces will necessarily 
be injective. Again, it seems likely that one should not need to pass to a very big 
multiple of a class to find an extremal surface (at least for some classes); so there 
should be injective surfaces of genus 0(n/log(n)). We strongly suspect this order 
of magnitude is sharp. 
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